Biologically plausible saliency mechanisms improve feedforward object recognition

نویسندگان

  • Sunhyoung Han
  • Nuno Vasconcelos
چکیده

The biological plausibility of statistical inference and learning, tuned to the statistics of natural images, is investigated. It is shown that a rich family of statistical decision rules, confidence measures, and risk estimates, can be implemented with the computations attributed to the standard neurophysiological model of V1. In particular, different statistical quantities can be computed through simple re-arrangement of lateral divisive connections, non-linearities, and pooling. It is then shown that a number of proposals for the measurement of visual saliency can be implemented in a biologically plausible manner, through such re-arrangements. This enables the implementation of biologically plausible feedforward object recognition networks that include explicit saliency models. The potential of combined attention and recognition is illustrated by replacing the first layer of the HMAX architecture with a saliency network. Various saliency measures are compared, to investigate whether (1) saliency can substantially benefit visual recognition and (2) the benefits depend on the specific saliency mechanisms implemented. Experimental evaluation shows that saliency does indeed enhance recognition, but the gains are not independent of the saliency mechanisms. Best results are obtained with top-down mechanisms that equate saliency to classification confidence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Object recognition with hierarchical discriminant saliency networks

The benefits of integrating attention and object recognition are investigated. While attention is frequently modeled as a pre-processor for recognition, we investigate the hypothesis that attention is an intrinsic component of recognition and vice-versa. This hypothesis is tested with a recognition model, the hierarchical discriminant saliency network (HDSN), whose layers are top-down saliency ...

متن کامل

A model of proto-object based saliency

Organisms use the process of selective attention to optimally allocate their computational resources to the instantaneously most relevant subsets of a visual scene, ensuring that they can parse the scene in real time. Many models of bottom-up attentional selection assume that elementary image features, like intensity, color and orientation, attract attention. Gestalt psychologists, however, arg...

متن کامل

Shetty, Sanketh v. a Biologically Plausible Architecture for Shape Recogni- Tion. (under the Direction of Professor a Biologically Plausible Architecture for Shape Recognition

SHETTY, SANKETH V. A Biologically Plausible Architecture for Shape Recognition. (Under the direction of Professor Wesley E. Snyder). This thesis develops an algorithm for shape representation and matching. The algorithm is an object centered, boundary-based method for shape recognition. Global features of the shape are utilized to define a frame of reference relative to which local shape featur...

متن کامل

NIMBLER: A Model of Visual Attention and Object Recognition With a Biologically Plausible Retina

NIMBLE is a cognitively plausible object recognition system that uses a saccadic visual memory to store and retrieve image fragments. These fragments are acquired by scanning an image in a human-like way using a bottom up saliency model to find informative regions, applying a kernel density function to the fragment to determine its familiarity, and then combining the fragments using naïve Bayes...

متن کامل

Visual saliency computations: mechanisms, constraints, and the effect of feedback.

The primate visual system continuously selects spatial proscribed regions, features or objects for further processing. These selection mechanisms--collectively termed selective visual attention--are guided by intrinsic, bottom-up and by task-dependent, top-down signals. While much psychophysical research has shown that overt and covert attention is partially allocated based on saliency-driven e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Vision Research

دوره 50  شماره 

صفحات  -

تاریخ انتشار 2010